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2 
3 



SEQUENCE LISTING 



4 (1) GENERAL INFORMATION 
5 

6 (i) APPLICANT: Darrell Anderson, Nabil Hanna, John Leonard, Roland Newman and Mitchell R 
7 

8 (ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF CHIMERIC ANTIBODY TO HUMAN B LYMPHOCY 
9 

10 (iii) NUMBER OF SEQUENCES: 8 
11 

12 (iv) CORRESPONDING ADDRESS: 
13 

14 (A) ADDRESSEE: IDEC Pharmaceuticals Corporation 

15 (B) STREET: 11099 N. Torrey Pines Road, #160 

16 (C) CITY: La Jolla 

17 (D) STATE: California 

18 (E) COUNTRY: USA 

19 (F)ZIP:92037 
20 

21 (v) COMPUTER READABLE FORM: 
22 

23 (A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

24 (B) COMPUTER: Macintosh 

25 (C) OPERATING SYSTEM: MS. DOS 

26 (D) SOFTWARE: Microsoft Word 5.0 
27 

28 (vi) CURRENT APPLICATION DATA: 
29 

30 (A) APPLICATION NUMBER: 07/978,891 

31 (B) FILING DATE: 13 NOV 1992 

32 (C) CLASSIFICATION: 424 
33 

34 (viii) ATTORNEY/AGENT INFORMATION: 
35 

36 (A) NAME: Burgoon, Richard P. Jr. 

37 (B) REGISTRATION NUMBER: 34,787 

38 (C) REFERENCE/DOCKET NUMBER: 
39 

40 (ix) TELECOMMUNICATION INFORMATION: 
41 

42 (A) TELEPHONE : (619) 458-0600 

43 (B) TELEFAX: (619) 546-9274 
44 

45 (2) INFORMATION FOR SEQ ID NO: 1: 
46 

47 (i) SEQUENCE CHARACTERISTICS: 
48 

49 (A) LENGTH: 8540 bases 

50 (B) TYPE : nucleic acid 

51 (C) STRANDEDNESS: single 

52 (D) TOPOLOGY: circular 
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53 

54 (ii) MOLECULE TYPE: DNA (genomic) 
55 

56 (iii) HYPOTHETICAL: no 
57 

58 (iv)ANTI- SENSE: no 
59 

60 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
61 

62 GACGTCGCGG CCGCTCTA6G CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

63 

64 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
65 

66 GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 
67 

68 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 
69 

70 GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 
71 

72 GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 

73 

74 AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
75 

76 GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
77 

78 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 
79 

80 TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
81 

82 AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
83 

84 ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 
85 

86 ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 
87 

88 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 
89 

90 GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
91 

92 CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
93 

94 CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 
95 

96 CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 
97 

98 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 
99 

100 TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 
101 

102 CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 

103 

104 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1320 
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TGCGAAGTCA 
TGTTGAATTC 
GGCCGTGATA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGAGC 
ATGGGTTGGA 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCCTCTTCC 
TGCGTGGTGG 
GGCGTGGAGG 
CGTGTGGTCA 
TGCAAGGTCT 
GGGCAGCCCC 



CCCATCAGGG 
AGATCCGTTA 
TCTACGTATG 
CCCCCGTGCC 
AGGAAATTGC 
AGGACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATGGGCGT 
CAATGGGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GAGAACCACA 



CCTGAGCTCG 
ACGGTTACCA 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
GGATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 



CCCGTCACAA 
ACTACCTAGA 
ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
TGGGAAGACA 
CGACAGCTAT 
ATTATGCCCA 
TCATCGCTAT 
TTGACTCACG 
ACCAAAATCA 
GCGGTAGGCG 
GTGATCAGCA 
GTCGCTGTTG 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 
CTGCCCCCAT 



AGAGCTTCAA 
CTGGATTCGT 
CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCAGGCA 
GCCAAGTACG 
GTACATGACC 
TACCATGGTG 
GGGATTTCCA 
ACGGGACTTT 
TGTACGGTGG 
CTGAACACAG 
CTACGCGTGT 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGCAGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 
CCCGGGATGA 



CAGGGGAGAG 
GACAACATGC 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CCCCCTATTG 
TTATGGGACT 
ATGCGGTTTT 
AGTCTCCACC 
CCAAAATGTC 
GAGGTCTATA 
ACCCGTCGAC 
CGCTAGCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
ACCGTCAGTC 
TGAGGTCACA 
GTACGTGGAC 
CAGCACGTAC 
GGAGTACAAG 
CAAAGCCAAA 
GCTGACCAAG 
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157 

158 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 2940 
159 

160 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3000 
161 

162 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 3060 
163 

164 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 3120 
165 

166 CTCTCCCTGT CTCCGGGTAA ATGAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
167 

168 TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 3240 
169 

170 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3300 
171 

172 CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3360 
173 

174 CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 3420 
175 

176 GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 3480 
177 

178 CCCGATCCCC AGCTTTGCTT CTCAATTXCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3540 
179 

180 AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 
181 

182 AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3660 
183 

184 GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 
185 

186 GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 3780 
187 

188 GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3840 
189 

190 CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3900 
191 

192 CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3960 
193 

194 GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4020 
195 

196 GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 
197 

198 TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 
199 

200 CCTGAGAAGA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 
201 

202 GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4260 
203 

204 GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4320 
205 

206 TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 4380 
207 

208 GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 
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209 

210 GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4500 
211 

212 GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 
213 

214 CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 
215 

216 TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4680 
217 

218 TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 4740 
219 

220 GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 
221 

222 CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 
223 

224 TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 
225 

226 ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 
227 

228 TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 
229 

230 GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 
231 

232 CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 
233 

234 AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 
235 

236 TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 
237 

238 TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 5340 
239 

240 CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 
241 

242 CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 5460 
243 

244 CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 
245 

246 GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 
247 

248 GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5640 
249 

250 CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 5700 
251 

252 TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 5760 
253 

254 CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 5820 
255 

256 CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 
257 

258 GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 5940 
259 

260 GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 6000 
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GCAGCGCATC 
GAAATGACCG 
TTCTATGAAA 
CGCGGGGATC 
GGTTACAAAT 
TCTAGTTGTG 
ATCCCGTCGA 
CGCTCACAAT 
AATGAGTGAG 
ACCTGTCGTG 
TTGGGCGCTC 
GAGCGGTATC 
CAGGAAAGAA 
TGCTGGCGTT 
GTCAGAGGTG 
CCCTCGTGCG 
CTTCGGGAAG 
TCGTTCGCTC 
TATCCGGTAA 
CAGCCACTGG 
AGTGGTGGCC 
AGCCAGTTAC 
GTAGCGGTGG 
AAGATCCTTT 
GGATTTTGGT 
GAAGTTTTAA 



GCCTTCTATC 
ACCAAGCGAC 
GGTTGGGCTT 
TCATGCTGGA 
AAAGCAATAG 
GTTTGTCCAA 
GAGCTTGGCG 
TCCACACAAC 
CTAACTCACA 
CCAGCTGCAT 
TTCCGCTTCC 
AGCTCACTCA 
CATGTGAGCA 
TTTCCATAGG 
GCGAAACCCG 
CTCTCCTGTT 
CGTGGCGCTT 
CAAGCTGGGC 
CTATCGTCTT 
TAACAGGATT 
TAACTACGGC 
CTTCGGAAAA 
TTTTTTTGTT 
GATCTTTTCT 
CATGAGATTA 
ATCAATCTAA 



GCCTTCTTGA 
GCCCAACCTG 
CGGAATCGTT 
GTTCTTCGCC 
CATCACAAAT 
ACTCATCAAT 
TAATCATGGT 
ATACGAGCCG 
TTAATTGCGT 
TAATGAATCG 
TCGCTCACTG 
AAGGCGGTAA 
AAAGGCCAGC 
CTCCGCCCCC 
ACAGGACTAT 
CCGACCCTGC 
TCTCAATGCT 
TGTGTGCACG 
GAGTCCAACC 
AGCAGAGCGA 
TACACTAGAA 
AGAGTTGGTA 
TGCAAGCAGC 
ACGGGGTCTG 
TCAAAAAGGA 
AGTATATATG 



CGAGTTCTTC 
CCATCACGAG 
TTCCGGGACG 
CACCCCAACT 
TTCACAAATA 
CTATCTTATC 
CATAGCTGTT 
GAAGCATAAA 
TGCGCTCACT 
GCCAACGCGC 
ACTCGCTGCG 
TACGGTTATC 
AAAAGGCCAG 
CTGACGAGCA 
AAAGATACCA 
CGCTTACCGG 
CACGCTGTAG 
AACCCCCCGT 
CGGTAAGACA 
GGTATGTAGG 
GGACAGTATT 
GCTCTTGATC 
AGATTACGCG 
ACGCTCAGTG 
TCTTCACCTA 
AGTAAACTTG 



TGAGCGGGAC 
ATTTCGATTC 
CCGGCTGGAT 
TGTTTATTGC 
AAGCATTTTT 
ATGTCTGGAT 
TCCTGTGTGA 
GTGTAAAGCC 
GCCCGCTTTC 
GGGGAGAGGC 
CTCGGTCGTT 
CACAGAATCA 
GAACCGTAAA 
TCACAAAAAT 
GGCGTTTCCC 
ATACCTGTCC 
GTATCTCAGT 
TCAGCCCGAC 
CGACTTATCG 
CGGTGCTACA 
TGGTATCTGC 
CGGCAAACAA 
CAGAAAAAAA 
GAACGAAAAC 
GATCCTTTTA 
GTCTGACAGT 



TCTGGGGTTC 
CACCGCCGCC 
GATCCTCCAG 
AGCTTATAAT 
TTCACTGCAT 
CGCGGCCGCG 
AATTGTTATC 
TGGGGTGCCT 
CAGTCGGGAA 
GGTTTGCGTA 
CGGCTGCGGC 
GGGGATAACG 
AAGGCCGCGT 
CGACGCTCAA 
CCTGGAAGCT 
GCCTTTCTCC 
TCGGTGTAGG 
CGCTGCGCCT 
CCACTGGCAG 
GAGTTCTTGA 
GCTCTGCTGA 
ACCACCGCTG 
GGATCTCAAG 
TCACGTTAAG 
AATTAAAAAT 
TACCAATGCT 
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313 

314 TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7620 
315 

316 TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7680 
317 

318 TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 7740 
319 

320 GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 7800 
321 

322 GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 7860 
323 

324 TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7920 
325 

326 CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7980 
327 

328 TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 8040 
329 

330 CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 
331 

332 AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 
333 

334 CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 8220 
335 

336 AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 8280 
337 

338 AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 8340 
339 

340 GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 8400 
341 

342 GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 8460 
343 

344 TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 8520 
345 

346 TTCCCCGAAA AGTGCCACCT 8540 

347 

348 

349 (3) INFORMATION FOR SEQ ID NO: 2: 
350 

351 (i) SEQUENCE CHARACTERISTICS: 
352 

353 (A) LENGTH: 9209 bases 

354 (B) TYPE : nucleic acid 

355 (C) STRANDEDNESS : single 

356 (D) TOPOLOGY: circular 
357 

358 (ii) MOLECULE TYPE: DNA (genomic) 
359 

360 (iii) HYPOTHETICAL: no 
361 

362 ( iv) ANT I - SENSE : no 
363 

364 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
398 
399 
400 
401 
402 
403 
404 
405 
406 
407 
408 
409 
410 
411 
412 
413 
414 
415 
416 



GACGTCGCGG 
AGGCCGAGGC 
GGAGAATGGG 
ATGGTTGCTG 
GACTTTCCAC 
GGGGAGCCTG 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCACAGAT 
GCTTCAGTCA 
GCATCTCCAG 
CACTGGTTCC 
CTGGCTTCTG 
ACAATCAGCA 
AACCCACCCA 
TCTGTCTTCA 
TGCCTGCTGA 



CCGCTCTAGG 
GGCCTCGGCC 
CGGAACTGGG 
ACTAATTGAG 
ACCTGGTTGC 
GGGACTTTCC 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
CTCTCACTAT 
TAATGTCCAG 
GGGAGAAGGT 
AGCAGAAGCC 
GAGTCCCTGT 
GAGTGGAGGC 
CGTTCGGAGG 
TCTTCCCGCC 
ATAACTTCTA 



CCTCCAAAAA 
TCTGCATAAA 
CGGAGTTAGG 
ATGCATGCTT 
TGACTAATTG 
ACACCCTAAC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTGGG 
GGATTTTCAG 
AGGACAAATT 
CACAATGACT 
AGGATCCTCC 
TCGCTTCAGT 
TGAAGATGCT 
GGGGACCAAG 
ATCTGATGAG 
TCCCAGAGAG 



AGCCTCCTCA 
TAAAAAAAAT 
GGCGGGATGG 
TGCATACTTC 
AGATGCATGC 
TGACACACAT 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TACGTGAACC 
GTGCAGATTA 
GTTCTCTCCC 
TGCAGGGCCA 
CCCAAACCCT 
GGCAGTGGGT 
GCCACTTATT 
CTGGAAATCA 
CAGTTGAAAT 
GCCAAAGTAC 



CTACTTCTGG 
TAGTCAGCCA 
GCGGAGTTAG 
TGCCTGCTGG 
TTTGCATACT 
TCCACAGAAT 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GTCAGATCGC 
TCAGCTTCCT 
AGTCTCCAGC 
GCTCAAGTGT 
GGATTTATGC 
CTGGGACTTC 
ACTGCCAGCA 
AACGTACGGT 
CTGGAACTGC 
AGTGGAAGGT 



AATAGCTCAG 
TGCATGGGGC 
GGGCGGGACT 
GGAGCCTGGG 
TCTGCCTGCT 
TAATTCCCCT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
GCTAATCAGT 
AATCCTGTCT 
AAGTTACATC 
CACATCCAAC 
TTACTCTCTC 
GTGGACTAGT 
GGCTGCACCA 
CTCTGTTGTG 
GGATAACGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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417 CTCCAATCGG GTAACTCCCA G6AGA6TGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1560 
418 

419 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1620 
420 

421 TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1680 
422 

423 TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 1740 
424 

425 GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1800 
426 

427 GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1860 
428 

429 TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1920 
430 

431 GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1980 
432 

433 GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 2 040 
434 

435 ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 2100 
436 

437 TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 2160 
438 

439 GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 2220 
440 

441 CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 2280 
442 

443 GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2340 
444 

445 TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2400 
446 

447 ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 
448 

449 GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 
450 

451 TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 2580 
452 

453 GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 2 640 
454 

455 CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGCCTACATG 2700 
456 

457 CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 2760 
458 

459 TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 2820 
460 

461 GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 2880 
462 

463 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 2940 
464 

465 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 
466 

467 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 3060 
468 
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469 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC 312 0 
470 

471 AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3180 
472 

473 CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 3240 
474 

475 GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 3300 
476 

477 TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 3360 
478 

479 AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 3420 
480 

481 GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 3480 
482 

483 AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 3540 
484 

485 CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 3 600 
486 

487 GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 3660 
488 

489 CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 3720 
490 

491 CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 3780 
492 

493 CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGAGGATCCG TTAACGGTTA CCAACTACCT 3840 
494 

495 AGACTGGATT CGTGACAACA TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 3900 
496 

497 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 3960 
498 

499 GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 4020 
500 

501 GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 4080 
502 

503 ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGA ACCAGCTGGG GCTCGACAGC 4140 
504 

505 GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 4200 
506 

507 AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 4260 
508 

509 ATGCTTTAGA GACAGTGTTC TCTGCACAGA TAAGGACAAA CATTATTCAG AGGGAGTACC 4320 
510 

511 CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 4380 
512 

513 ATCACCGAAG CCTGATTCCG TAGAGCCACA CCTTGGTAAG GGCCAATCTG CTCACACAGG 4440 
514 

515 ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 4500 
516 

517 CATTTGCTTC TGACATAGTT GTGTTGGGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 4560 
518 

519 ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 4620 
520 
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521 
522 
523 
524 
525 
526 
527 
528 
529 
530 
531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
550 
551 
552 
553 
554 
555 
556 
557 
558 
559 
560 
561 
562 
563 
564 
565 
566 
567 
568 
569 
570 
571 
572 



GCCATCATGG 
AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 
CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTGGGAAGAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CGCAGGTTCT 
AATCGGCTGC 
TGTCAAGACC 
GTGGCTGGCC 



TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 
CTGAGAAGAA 
AACCACCACG 
AACAACCGGA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 
TATGCATTTT 
TCTAGTTGCC 
GCCACTCCCA 
TGTCATTCTA 
AATAGCAGGC 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
CCGGCCGCTT 
TCTGATGCCG 
GACCTGTCCG 
ACGACGGGCG 



GAACTGCATC 
GCCTCCGCTC 
AGGTAAACAG 
TCGACCTTTA 
AGGAGCTCAT 
ATTGGCAAGT 
CATGAATCAA 
TGACACGTTT 
CGTCCTCTCT 
GAAAGACTAA 
TATAAGACCA 
AGCCATCTGT 
CTGTCCTTTC 
TTCTGGGGGG 
ATGCTGGGGA 
CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATCG 
GGGTGGAGAG 
CCGTGTTCCG 
GTGCCCTGAA 
TTCCTTGCGC 



GTCGCCGTGT 
AGGAACGAGT 
AATCTGGTGA 
AAGGACAGAA 
TTTCTTGCCA 
AAAGTAGACA 
CCAGGCCACC 
TTCCCAGAAA 
GAGGTCCAGG 
CAGGAAGATG 
TGGGACTTTT 
TGTTTGCCCC 
CTAATAAAAT 
TGGGGTGGGG 
TGCGGTGGGC 
TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGGGAGAA 
GGGCCAATCT 
GGTAGGATCA 
ATCCTCTATG 
GCTATTCGGC 
GCTGTCAGCG 
TGAACTGCAG 
AGCTGTGCTC 



CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 
TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 
GCTGGCTTTA 
TCCCCCGTGC 
GAGGAAATTG 
CAGGACAGCA 
TCTATGGAAC 
ATGAGAAAAA 
TGCCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
GTTGAACAAG 
TATGACTGGG 
CAGGGGCGCC 
GACGAGGCAG 
GACGTTGTCA 



GGGGATTGGC 
CCAAAGAATG 
GAAAACCTGG 
TCTCAGTAGA 
TGATGCCTTA 
AGTCGGAGGC 
TGTGACAAGG 
GAAATATAAA 
CATCAAGTAT 
CTCTGCTCCC 
GATCAGCCTC 
CTTCCTTGAC 
CATCGCATTG 
AGGGGGAGGA 
CAGCTGGGGC 
AAGGAAAATT 
GATGCTTTAG 
CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
ATGGATTGCA 
CACAACAGAC 
CGGTTCTTTT 
CGCGGCTATC 
CTGAAGCGGG 



4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
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573 
574 
575 
576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 
595 
596 
597 
598 
599 
600 
601 
602 
603 
604 
605 
606 
607 
608 
609 
610 
611 
612 
613 
614 
615 
616 
617 
618 
619 
620 
621 
622 
623 
624 



AA6GGACT6G 
TCCTGCCGAG 
GGCTACCTGC 
GGAAGCCGGT 
CGAACTGTTC 
TGGCGATGCC 
CTGTGGCCGG 
TGCTGAAGAG 
TCCCGATTCG 
CTGGGGTTCG 
ACCGCCGCCT 
ATCCTCCAGC 
GCTTATAATG 
TCACTGCATT 
GCGGCCGCGA 
ATTGTTATCC 
GGGGTGCCTA 
AGTCGGGAAA 
GTTTGCGTAT 
GGCTGCGGCG 
GGGATAACGC 
AGGCCGCGTT 
GACGCTCAAG 
CTGGAAGCTC 
CCTTTCTCCC 
CGGTGTAGGT 



CTGCTATTGG 
AAAGTATCCA 
CCATTCGACC 
CTTGTCGATC 
GCCAGGCTCA 
TGCTTGCCGA 
CTGGGTGTGG 
CTTGGCGGCG 
CAGCGCATCG 
AAATGACCGA 
TCTATGAAAG 
GCGGGGATCT 
GTTACAAATA 
CTAGTTGTGG 
TCCCGTCGAG 
GCTCACAATT 
ATGAGTGAGC 
CCTGTCGTGC 
TGGGCGCTCT 
AGCGGTATCA 
AGGAAAGAAC 
GCTGGCGTTT 
TCAGAGGTGG 
CCTCGTGCGC 
TTCGGGAAGC 
CGTTCGCTCC 



GCGAAGTGCC 
TCATGGCTGA 
ACCAAGCGAA 
AGGATGATCT 
AGGCGCGCAT 
ATATCATGGT 
CGGACCGCTA 
AATGGGCTGA 
CCTTCTATCG 
CCAAGCGACG 
GTTGGGCTTC 
CATGCTGGAG 
AAGCAATAGC 
TTTGTCCAAA 
AGCTTGGCGT 
CCACACAACA 
TAACTCACAT 
CAGCTGCATT 
TCCGCTTCCT 
GCTCACTCAA 
ATGTGAGCAA 
TTCCATAGGC 
CGAAACCCGA 
TCTCCTGTTC 
GTGGCGCTTT 
AAGCTGGGCT 



GGGGCAGGAT 
TGCAATGCGG 
ACATCGCATC 
GGACGAAGAG 
GCCCGACGGC 
GGAAAATGGC 
TCAGGACATA 
CCGCTTCCTC 
CCTTCTTGAC 
CCCAACCTGC 
GGAATCGTTT 
TTCTTCGCCC 
ATCACAAATT 
CTCATCAATC 
AATCATGGTC 
TACGAGCCGG 
TAATTGCGTT 
AATGAATCGG 
CGCTCACTGA 
AGGCGGTAAT 
AAGGCCAGCA 
TCCGCCCCCC 
CAGGACTATA 
CGACCCTGCC 
CTCAATGCTC 
GTGTGCACGA 



CTCCTGTCAT 
CGGCTGCATA 
GAGCGAGCAC 
CATCAGGGGC 
GAGGATCTCG 
CGCTTTTCTG 
GCGTTGGCTA 
GTGCTTTACG 
GAGTTCTTCT 
CATCACGAGA 
TCCGGGACGC 
ACCCCAACTT 
TCACAAATAA 
TATCTTATCA 
ATAGCTGTTT 
AAGCATAAAG 
GCGCTCACTG 
CCAACGCGCG 
CTCGCTGCGC 
ACGGTTATCC 
AAAGGCCAGG 
TGACGAGCAT 
AAGATACCAG 
GCTTACCGGA 
ACGCTGTAGG 
ACCCCCCGTT 



CTCACCTTGC 

CGCTTGATCC 

GTACTCGGAT 

TCGCGCCAGC 

TCGTGACCCA 

GATTCATCGA 

CCCGTGATAT 

GTATCGCCGC 

GAGCGGGACT 

TTTCGATTCC 

CGGCTGGATG 
GTTTATTGCA 

AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAAGCCT 
CCCGCTTTCC 
GGGAGAGGCG 
TCGGTCGTTC 
ACAGAATCAG 
AACCGTAAAA 
CACAAAAATC 
GCGTTTCCCC 
TACCTGTCCG 
TATCTCAGTT 
CAGCCCGACC 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
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625 GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 7800 
626 

627 CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 7860 
628 

629 AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 7920 
630 

631 CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA 7980 
632 

633 CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 8040 
634 

635 GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 8100 
636 

637 CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA 8160 
638 

639 ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 8220 
640 

641 ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 8280 
642 

643 TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 8340 
644 

645 GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 8400 
646 

647 AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 8460 
648 

649 CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 8520 
650 

651 TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA 8580 
652 

653 GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 8640 
654 

655 TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA 8700 
656 

657 TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG 8760 
658 

659 TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT 8820 
660 

661 CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA 8880 
662 

663 TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 8940 
664 

665 GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 9000 
666 

667 TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 9060 
668 

669 GGAAATGTXG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 9120 
670 

671 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 9180 
672 

673 CGCGCACATT TCCCCGAAAA GTGCCACCT 9209 

674 

675 

676 (4) INFORMATION FOR SEQ ID NO: 3: 
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677 

678 (i) SEQUENCE CHARACTERISTICS: 
679 

680 (A) LENGTH: 47 bases 

681 (B) TYPE: nucleic acid 

682 (C) STRANDEDNESS : single 

683 (D) TOPOLOGY: linear 
684 

685 ( i i ) MOLECULE TYPE: DNA (genomic) 
686 
687 
688 




(iii) HYPOTHETICAL: no ^S\C$S k " ^ 

689 (iv)ANTI- SENSE: no ^S>W\ /f^~~ 



690 

691 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3: I/O 

692 lU 

693 ATC ACA GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT45 

694 TC47 



695 /T5*) INFORMATION FOR SEQ ID NO: 4: 

696 ^ocjl. 

697 (i) SEQUENCE CHARACTERISTICS: 
698 

699 (A) LENGTH: 30 bases 

700 (B) TYPE : nucleic acid 

701 (C) STRANDEDNESS: single 

702 (D) TOPOLOGY: linear 
703 

704 (ii) MOLECULE TYPE: DNA (genomic) 
705 

706 (iii) HYPOTHETICAL: no 
707 

708 (iv)ANTI- SENSE: yes 
709 

710 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
711 

712 TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT 30 
713 

714 (6) INFORMATION FOR SEQ ID NO: 5: 
715 

716 (i) SEQUENCE CHARACTERISTICS: 
717 

718 (A) LENGTH: 384 bases 

719 (B) TYPE : nucleic acid 

720 (C) STRANDEDNESS: single 

721 (D) TOPOLOGY: linear 
722 

723 (ii) MOLECULE TYPE : DNA (genomic) 
724 

725 (iii) HYPOTHETICAL: no 
726 

727 (iv)ANTI- SENSE: no 
728 
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729 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

730 

731 



732 


ATG 


GAT 


TTT 


CAG 


GTG 


CAG 


ATT 


ATC 


AGC 


TTC 


CTG 


CTA 


ATC 


AGT 


GCT 


TCA 


733 


































734 


ATA 


ATG 


TCC 


AGA 


GGG 


CAA 


ATT 


GTT 


CTC 


TCC 


CAG 


TCT 


CCA 


GCA 


ATC 


CTG 


735 


































736 


GCA 


TCT 


CCA 


GGG 


GAG 


AAG 


GTC 


ACA 


ATG 


ACT 


TGC 


AGG 


GCC 


AGC 


TCA 


AGT 


737 


































738 


AGT 


TAC 


ATC 


CAC 


TGG 


TTC 


CAG 


CAG 


AAG 


CCA 


GGA 


TCC 


TCC 


CCC 


AAA 


CCC 


739 


































740 


ATT 


TAT 


GCC 


ACA 


TCC 


AAC 


CTG 


GCT 


TCT 


GGA 


GTC 


CCT 


GTT 


CGC 


TTC 


AGT 


741 


































742 


AGT 


GGG 


TCT 


GGG 


ACT 


TCT 


TAC 


TCT 


CTC 


ACA 


ATC 


AGC 


AGA 


GTG 


GAG 


GCT 


743 


































744 


GAT 


GCT 


GCC 


ACT 


TAT 


TAC 


TGC 


CAG 


CAG 


TGG 


ACT 


AGT 


AAC 


CCA 


CCC 


ACG 


745 


































746 


GGA 


GGG 


GGG 


ACC 


AAG 


CTG 


GAA 


ATC 


AAA384 
















747 
748 

749 (7) INFORMATION FOR SEQ ID NO: 
750 

751 (i) SEQUENCE CHARACTERISTICS: 
752 

753 (A) LENGTH: 27 bases 

754 (B) TYPE: nucleic acid 

755 (C)STRANDEDNESS: single 

756 (D) TOPOLOGY: linear 
757 

758 (ii) MOLECULE TYPE: DNA (genomic) 
759 

760 (iii) HYPOTHETICAL: no 
761 

762 (iv)ANTI- SENSE: no 
763 

764 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
765 

766 GCG GCT CCC ACG CGT GTC CTG TCC CA§27 
767 

768 (8) INFORMATION FOR SEQ ID NO: 
769 

770 (i) SEQUENCE CHARACTERISTICS: 
771 

772 (A) LENGTH: 29 bases 

773 (B) TYPE : nucleic acid 

774 (C) STRANDEDNESS: single 

775 (D) TOPOLOGY: linear 
776 

777 ( ii ) MOLECULE TYPE: DNA (genomic) 
778 

779 (iii) HYPOTHETICAL: no 
780 



: CAG2" 
7: 
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781 (iv)ANTI- SENSE: yes 
782 

783 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
784 

785 66S TGT TGT GCT AGC TGM RGA GAC RGT GA29 
786 

787 (9) INFORMATION FOR SEQ ID NO: 8: 
788 

789 (i) SEQUENCE CHARACTERISTICS: 
790 

791 (A) LENGTH: 420 bases 

792 (B) TYPE: nucleic acid 

793 (C) STRANDEDNESS : single 

794 (D) TOPOLOGY: linear 
795 

796 (ii) MOLECULE TYPE: DNA (genomic) 
797 

798 (iii) HYPOTHETICAL: no 
799 

800 (iv)ANTI- SENSE: no 
801 

802 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

804 ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT GTC 51 
805 

806 CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG CCT GGG102 
807 

808 GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT ACC AGT TAC153 
809 

810 AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG GAA TGG ATT GGA 204 

812 GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT CAG AAG TTC AAA GGC255 
813 

814 AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAG CTC306. 

815 l^T 

816 AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC TAT TAC TGT GCA AGA TCG ACT357 

817 \s£ 

818 TAC TAC GGC GGT GAC TGG TAC TTC AAT GTC TGG GGC GCA GGG ACC ACG GTC408 

819 ACC GTC TCT GCA 420 
820 




